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Sequence Description score _E:vQjue_ _N_ 

TtnlT PFOOOOl "? transwcnbrttne receptor (rhodopsln 119.9 4.7e-37 1 

Parsed for donainss 

Sequence Donajn seq;^^ ^fil"^ hnn-f jinn-t score E-value - 
ytnjT T/T " "57" " "3"2r" 1 "259 C 3 l"l979 4:7e-"37' 
AUonnents of top-scoring dona ins: 

7tn_li donaln 1 of 1, fron 57 to 321: score 119.9, E = 4. 7e-37 

x->GN 1 LV i 1 V i 1 r-tkk 1 rtptn i f i 1 NLAVADLLf 1 1 1 1 ppwa 1 y y 1 vg 
GN+ ++++++r +++r +t +++1NL ADLLf + p++ ++ -+ 
Flhl4273, 57 GNVCALVLVAR-RRRRGATACLVLNLFCADLLFISAIPLVLAVR-WT 101 

gaadWpf GsQ 1 Ck 1 vta I dvvnnyaS i 1 1 Lta 1 S i DRY 1 A 1 vhP 1 ryrrr 
e W++G++ C+1+ +++++++ + il+L+a S++R + Iv 1+ +r 
Flhl4373, 1 02 -EAWLLGPVACHLLFYYMTLSGSVTILTLAAVSLERMVCIV-HLQRGVR 1 48 

r-tsprr Akvv 1 1 1 vwv 1 a 1 1 1 s 1 Pp 1 1 f swvk tveegngt 1 nvnve vC 1 i 
+r +V+++1+W +++++1P +f+ v+ ++ ++ ++ +C++ 
Flhl4273, 149 GPGRRARAVLLALIWGYSAVAALPLCVFFRVVPQRLPG—ADQEISICTL 196 



« * I I t 



df pccstas vstw I rsy v 1 1 st 1 wgF 1 1 P 1 1 v i I vc Y tr II rt 1 r. 
+P++++ ++S+ +++ ++ Fl+P lvi++ Y+ II + + ++++ 
Flhl4273. 197 IWPMPG-'-^-r£.iSWBVSFVTLNFLVRGLVIVJSYSKILQITKasrkr 240 

kaak 1 1 1 vvvvvFv 1,CW 1 Pyf i v 1 1 1 dt 1 c 

+ + +++++ + ++++ ++ +tl++++v F++ W P i++ll + 
Flhl4273, 241 IcvslayseePiqirv^qqi^^^^ 290 

. Isi insstCelervlptoillvtlwLayvNsclNPilY<-x 
-+ + + & ++++ + ++++NQ+lNPi+Y 

Flhl4273, 291 nFK QDLVIWPSLFFWVVAPTFANSALNPILY 321 

FIG. 2. 
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Analysis of Flh14273. (362 aa) 
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5.1 ""4.0 4.0 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 41 81 121 161 201 241 281 321 361 



>Flhl4273, 1086 bases, 1825 checksum. 

MSPECARAAQDAPLRSLEQANRTRFPFFSDVKGDHRLVLAAVETTVLVLEFAVSLLGNVC 
ALVLVARRRRRGATAQLVLNLFCADLLFYSAIPLVLAVRWTEAVLLGPVACHLLFYVMTL 
SGSVTILTLAAVSLERMVCIVHLQRCVRGPGRRARAVLLALIWGYSAVAALPLCVFFRVV 
PQRLPGADQEISICTLIWPTIPGEISWfiVSFVTLNFliyPiiLVIVISYSKILQITKASRKR 
LTVSLAYSESHQIRVSQQ#ftLFI?tt!li'L|i|VSFFlMWSPlIITILLILIQNFKQDLVIWP 
SLFFVVVAFTFANSALNPILYNMTLCRNEWKKIFCCPWFGPEKGAILTDTSVKRNDLSIIS 

FIG. 4. 
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Transmembrane Segments Predicted by MEMSAT 
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out->ins 


3.2 



>F 11^14273, 

I^SPECARAAGDAPLRSLEQ ANRTRFPFFSD VKGDHRL VL A AVETT VL VL I PAVSLLGNVC 
AL VL V ARRRRRGAT ACL VLNLFCADLLF I S AWPL VL A VRWTE AWLLGPV ACHLLF Y VMTL 
SGSVT I LTL AAVSLERMVC I VHLQRGVRGPGRR ARAVLL AL I WGYSAVAALPLCVFFRVV 
PQRLPG ADQE I S I CTL I WPT I PGE I S WD VSF VTLNFL VPGL V I V I S YSK I LQ I TK ASRKR 
LTVSLAYSESHQIRVSQQDFRLFRTLFLLMVSFFIMWSPI I ITILLILIQNFKQDLVIWP 
SLFF W VV AFTF AlvIS ALNP I L YNMTLCRNEWKK I FCCF WFPEKGA I LTDTS VKRNDLS 1 1 S 
G 

Transmembrane Segments for Presumed Mature Peptide 



Start 


End 


Orient | 


Score 


14 


37 






52 


73 


out->ins 


4.0 ^ 


95 


116 


ins->6Uf' 


4.9 ■ ' 


148 


166 


out->ins 


3.7 


205 


228 


ins->out 


6.5 : : 


236 


260 


out->ins 


3.2 



>Flhl4273. .nature 

LVLV ARRRRRGAT ACLVLNLFCADLLF ISA I PL VL A VRWTEAWLLGPVACHLLPYVMTLS 
GS VT I LTLA A VSLERM VC I VHLQRGVRGPGRR ARAVLL AL I WG YS A V A ALPLC VFFR VVP 
QRLPGADQE I S I CTL I WPT I PGE I SWDVSF VTLNFL VPGL V I V I S YSK I LQ I TK ASRKRL 
TVSLAYSESHQIRVSQQDFRLFRTLFLLMVSFFIMWSPI I ITILLILIQNFKQDLVIWPS 
LFFW V VAFTF ANSALNP I LYNMTLCRNEWKK I PCCF WFPEKGA I LTDTS VKRNDLS 1 1 SG 

FIG. 6. 
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Query! 14273n, 
Scores for sequence fanlly clQSSif icotion (score includes all clonalns)i 
Sequence Descr 1 pt i on ^S9Lt E-vajue N 

7tn_l " " PRDOOOr 7 tronsnenbrane receptor (rhodops In liaS Te-SS ' l" 
Parsed for donains! 

Sequence Dqrittjn seq-f se^-t hwv-f hnn_~'t E^'value — 

IXn'Sr " 1/1 " 57" 321 .. "l 259 C 3 fial ie-SS 
Alignnents of top-scoring donalns! 

7tn_l: domain 1 of 1, from 57 to 321i score 118.8, E = le-36 

3ic->GN 1 LV i 1 V 1 1 rtkk 1 rtptn I f 1 1 NLAvADLLf 1 1 1 1 ppwQ 1 yy 1 vg 
GN+ ++++++r +++r ++ +++1NL ADLLf+ + p++ ++ ++ 
14273n, 57 GNVCALVLVAR-RRRRGASASLVLNLFCADLLFTSAIPLVLVVR-WT 101 

gaadWpfGcalCklvtQldvvnnyaS 1 1 1 ILtalS I DRYlAlvhPlryrrr 
e W++G+++C+1+ +++++++ + il+L+a S++R + Iv Ir + 
1 4273n, 1 02 — EAWLLGPVVCHLLPYVMTMSGSVTILTLAAVSLERMVCIV-RLRRGLS 1 48 

r t sprr A. kv V 1 1 1 vW V 1 a 1 1 1 s 1 Pp 1 1 f s w vk tveagng 1 1 nvnv t vC 1 
rr+++++++++W ++1++1P ++++ V + ++Q ++ +C+ 
14273n, 149 GP-GRRTqAALLAFIWGYSALAALPLYILFRVVPQRLPdGD— QEITPCT 195 

idfpeestasvst;wlrsyvllstlvgF21Pllvl IvoYtrllrtlr. . . . 
+d+p++ + ++S+ +++ ++ Fl+P lvi++ Y+ 11 + + +++ 
14273n, 196 LDWPNRIG EISWDVFFETLNFLVPGLVIVISYSKILQITKasrk 239 



• • • • * 1 1 



•kaakt 1 IvvvvvFvlCWlPyf i vl 1 Idt- 
+ »+•+> +tl++++v F++ W P I ++11 + 

14273n,240 rltlslayseshqlrvsqqdyRLFRTLFLLMVSFFIKWSPIIITILLILI 289 

c. lai iinestCelerv'rptQl'lvtlwLayyNs,clNf?UY<-x 
++ + + p^++^+ + ++++Ns+lNPi+Y 

14273n,290 QnFR QDLVIWPSLFFWVVAFTFANSALNPILY 321 



FIG. 8 
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Analysis of 14273m. (362 aa) 




I T] i| I I I j I I I I i I i I 1)1 PI 1 I I I I I r| I I I I I I 
1 41 81 121 161 201 241 281 321 361 



>14273n, 1086 bases, 6943 checksum. 

MSPECAQTTGPCPSHTLDQVNRTHFPFFSDVKGDHRLVLSVVETTVLGLIFVVSLLGNVC 
ALVLVARRRRRGASASLVLNLFCADLLFTSAIPLVLVVRI/TEAWLLGPVVCHLLFVVMTM 
SGSVTILTLAAVSLERMVCIVRLRR6LSGPGRRTQAALLAFIWGYSALAALPLYILFRVV 
PQRLPGGDQEIPICTLDWPNRIGEISWDVFFETLNFLVPGLVIVISYSKILQITKASRKR 
LTLSLAYSESHQIRVSQQDYRLF^TLFLLMVSf F^ WI I ITILLILIQNFRQDLVI WP 
SLFFWVVAFTFANSALNPitVlkMSlS^'RltWI^klrtdftlTEKGAIFTDTSVRRND^^^ 

;m. 10. 
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Proslte Pa-t-tern Matches for 14273n, 

>P^0001/PD[]C00001jASN.aYCQSYLATIDN N-glycosylation site. 

Queryi 21 NRTH 24 
Queryj 382 NRTH 325 

>PSQ0QQ£/PDDC00002/GLYC[1SAMINDGLYCAN Glycosaninoglycan attochnent site. 
RU Additional rules: 

RU There Piust be ot least two acidic onino acids (Glu or Asp) fron -2 to 
-4 relative to the serine. 

Queryi 148 SGPG 151 

>PSQQQg4/PDDC00004/CAHP_PH0SPHD_SITE: cAMP- and cGMP-dependent protein kinose phosphorylation 
Queryi 239 KRLT 242 

> PSOOQ05/ PDOC00005/PKC PHDSPHD SITE Protein kinose C phosphorylation site. 

Queryi 237 SRK 239 
Queryi 350 SVR 352 

>PS00006/PDI]C00006/CK2.PHD3PMI|;^^^^^^ phosphorylation site. 

Queryi 40 SVVE 43 ..... 
Query! 256 SQQD 259 

>PS00008/PDDC00008/MYRISTYL N-nyrlstoylot'idn site. ► - . 

Queryi 57 GNVCAL 62 
Queryi 72 GASASL 77 
Queryi 343 GAIFTD 348 

>PSQQQQi/PDDC00009/AMIDATII]N Anidation site. 
Queryi 150 PGRR 153 



FIG. 11. 
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Transmembrane Segments Predicted by MEMSAT 
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>14273n. 

MSPECAQTTGPGPSHTLDQVNRTHFPFFSDVKGDHRLVLSVVETTVLGL I FVVSLLGNVC 
ALVL VARRRRRGAS ASL VLNLFC ADLLFTS A I PL VL VVRWTEAWLLGP VVCHLLFY VMTM 
SGS VT I LTL A A VSLERM VC I VRLRRGLSGPGRRTQA ALL AF I WGYS AL AALPL Y I LFR V V 
PQRLPGGDQE I P I GTLDWPNR I GE I SWDVFFETLNFL VPGL V I V I S YSK I LQ I TKASRKR 
LTLSLA YSESHQ I R VSQQD YRLFRTLFLLM VSFF I MWSP 1 1 1 T I LL I L I QNFRQDL V I WP 
SLFFW VV AFTF ANS ALNP I L YMMSLFRNEWRK I FCCFFFPEKGA I FTDTS VRRNDLS V I S 
S 

Transmembrane Segments for Presumed Mature Peptide 
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>14273n, ^nature 

LVLVARRRRRGASASLVLNLFCADLLFTSAIPLVLVVRWTEAWLLGPVVCHLLFYVMGMS 
GSVT I LTLAAVSLERMVC I VRLRRGLSGPGRRTQAA 1 1 AF I WG YSALAALPL Y I LFR VVP 
QRLPGGDQE I P I CTLDWPNR I GE I SWC VFFETLNFL VPGL V I V I S YSK I LQ I TK ASRKRL 
TLSLAYSESKQIRVSQQDYRLFRTLFLLMVSFFTMWSPI I ITILLILIQNFRQDLVIWPS 
LFF WW AFTF ANS ALNP I L YNMSLFRNEWRK I FCCFFFPEKGA I FTDTS VRRNDLS V I SS 



FIG. 12. 



